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The Biotechnology Systems Branch of the Scientific and Technical Information Center 
(STIC) detected errors when processing the following CRF diskette: 



... THE ATTACHED PRINTOUT EXPLAINS THE ERRORS QET;]|QrED. 

PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS 
BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOtIR NEXT 
COMMUNICATION TO THE APPLICANTS ALONG WITH A NOTICE TO 
COMPLY or, 



2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY 

THIS WILL INSURE THAT THE NEXT SUBMISSION RECEIVED FROM THEM 
WILL BE ERROR FREE. 
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<110> APPLICANT: Olaf Schneewind 

Sarkis M azmanian_ 

Gwen Liu 

Hung Ton -Tha t ^ 

<120> TITLE OF INVENTION: IDENTIFICATION OF SORTASE GENE 

<130> FILE REFERENCE: 510015.213 — — ^ / 

<140> CURRENT APPLICATION NUMBER :<r 'gS7^0^7 ^-2', 4 - 3 6-J) O 'Z^f"' 
<141> CURRENT FILING DATE: 1999-04-15 " 
<160> NUMBER OF SEQ ID NOS : 36 

<170> SOFTWARE: FastSEQ for Windows Version 3.0 
<210> SEQ ID NO 1 
<211> LENGTH: 5 
<212> TYPE: PRT 
<213> ORGANISM: 
<400> SEQUENCE^ 

Leu Pro(XaajThr Gly 
1 V_/ 5 

<210> SEQ ID NO 2 
<211> LENGTH: 621 
<212> TYPE: DNA 

<213> ORGANISM: Staphylococcus aureus 
<220> FEATURE: 
<221> NAME/KEY: CDS 
<222> LOCATION: (l) . . . (621) 
<400> SEQUENCE: 2 



7. 
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ID NO 3 
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LENGTH : 
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<212> 


TYPE : PRT 
























68 


<213> 


ORGANISM: Staphylococcus aureus 
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SEQUENCE: 3 
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ID NO 4 
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ORGANISM: Actinomyces naeslundii 
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145 65 70 75 80 

146 Lys Ala Asn Asn Glu Gly Leu Met Ala Arg Leu Lys lie Pro Ser lie 

147 85 90 95 

148 Ser Leu Asp Leu Pro Val Tyr His Gly Thr Ala Asp Asp Thr Leu Leu 

149 100 105 110 

150 Lys Gly Leu Gly His Leu Glu Gly Thr Ser Leu Pro Val Gly Gly Glu 

151 115 120 125 

152 Gly Thr Arg Ser Val lie Thr Gly His Arg Gly Leu Ala Glu Ala Thr 

153 130 135 140 

154 Met Phe Thr Asn Leu Asp Lys Val Lys Thr Gly Asp Ser Leu lie Val 

155 145 150 155 160 

156 Glu Val Phe Gly Glu Val Leu Thr Tyr Arg Val Thr Ser Thr Lys Val 

157 165 170 175 

158 Val Glu Pro Glu Glu Thr Glu Ala Leu Arg Val Glu Glu Gly Lys Asp 

159 180 185 190 

160 Leu Leu Thr Leu Val Thr Cys Thr Pro Leu Gly lie Asn Thr His Arg 

161 195 200 205 

162 lie Leu Leu Thr Gly Glu Arg lie Tyr Pro Thr Pro Ala Lys Asp Leu 

163 210 215 220 

164 Ala Ala Ala Gly Lys Arg Pro Asp Val Pro His Phe Pro Trp Trp Ala 

165 225 230 235 240 

166 Val Gly Leu Ala Ala Gly Leu lie Val Val Gly Leu Tyr Leu Trp Arg 

167 245 250 255 

168 Ser Gly Tyr Ala Ala Ala Arg Ala Lys Glu Arg Ala Leu Ala Arg Ala 

169 260 265 270 

170 Arg Ala Ala Gin Glu Glu Pro Gin Pro Gin Thr Trp Ala Glu Gin Met 

171 275 280 285 

172 Arg lie Trp Met Asp Asp Asp Ala Gly Val Glu Pro Gin Arg Trp Phe 

173 290 295 300 

174 Thr Asp Leu Pro Val Pro Pro Gin Pro Ser Glu Met Glu Asn Leu Ala 

175 305 310 315 320 

176 Leu Leu Glu Glu lie Ala Ser Leu Ser Ala Pro Ser Gly Arg Trp Asp 

177 325 330 335 

178 Asp Gin Glu Leu lie Asp Thr Ala Glu lie Pro Val Leu Asp Ala Thr 

179 340 345 350 

180 Arg Pro Ser Ala Gly Thr Ser Gly Arg Thr His Arg Leu 

181 355 360 365 

182 <210> SEQ ID NO 6 

183 <211> LENGTH: 284 

184 <212> TYPE: PRT 

185 <213> ORGANISM: Enterococcus faecalis 

186 <400> SEQUENCE: 6 

187 Met Lys Ser Lys Lys Lys Arg Arg lie lie Asp Gly Phe Met lie Leu 

188 15 10 15 

189 Leu Leu lie lie Gly lie Gly Ala Phe Ala Tyr Pro Phe Val Ser Asp 

190 20 25 30 

191 Ala Leu Asn Asn Tyr Leu Asp Gin Gin lie lie Ala His Tyr Gin Ala 

192 .35 40 - 45 ' 

193 Lys Ala Ser Gin Glu Asn Thr Lys Glu Met Ala Glu Leu Gin Glu Lys 

194 50 55 60 
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90 










95 




240 




Ala 


He 


Pro 


Asp 


Leu 


Lys 


He 


Asn 


Leu 


Pro 


He 


Phe 


Lys 


Gly 


Leu 


Asp 
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100 










105 










110 






242 




Asn 


Val 


Gly 


Leu 


Thr 


Tyr 


Gly 


Ala 


Gly 


Thr 


Met 


Lys 


Asn 


Asp 


Gin 


Val 


243 








115 










120 










125 








244 




Met 


Gly 


Glu 


Asn 


Asn 


Tyr 


Ala 


Leu 


Ala 


Ser 


His 


His 


Val 


Phe 


Gly 


Met 



PAGE:- - 6 ^ VERIFICATION SUMMARY DATE: 04/28/1999 

PATENT APPLICATION US/09/292,436 TIME: 11:09:18 

Input Set: 1292436. RAW 

Line ? Error /Warning Original Text 



16 W "N" or "Xaa" used: Feature required 
63 W Line data has been corrected 



Leu Pro Xaa Thr Gly 

Val Trp Glu Lys Arg Lys lie Phe Val Ala T 



PAGE.: . . 7 ^ CORRECTION SUMMARY DATE: 04/28/1999 

PATENT APPLICATION US/09/2 92,436 TIME: 11:09:18 

Input Set: 1292436. RAW 

Line Original Text Corrected Data 



63 Val Trp Glu Lys Arg Lys lie Phe Val Ala T Val Trp-Glu Lys Arg Lys lie Phe Val Ala T 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGUSH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



Wrapped Nudelcs 



Wrapped Aminos 



The numbcrAext at the end of each Bno Nvrapped' down to the next Dne. 
This may occur If your file was retrfeved In a word processor after creating It 
Please adjust your right margin to ,3, as this will prevent "wrapping*. 

The amino add nunoberAext at the end of each line Vrapped ' down to the next line. 
This may occur If your file was retrieved In a word processor after creating it 
Please adjust your right margin to .3, as this will prevent Vrapping". 



Incorrect Une Length The rules require that a line not exceed 72 characters In length. This Indudes spaces. 
AH text must be visible on page. 

MisaHgned Amino Add The numbering under each 5lh amino add b mlsafigned. This may be caused by the use of labs 
Numbering between the numbering. It Is recommended to delete any labs and uses spadng between the numbers. 



10' 



Non-ASCII 



Variable tcnglh 



Wrong DcsignaUon 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
{NEW RULES) 



Use of n*3 or Xaa's 
(NEW RULES) 



This file was not saved In ASCII (DOS) text as required by the Sequence Rules. 

Please ensure your subsequent submission Is saved In ASCII text so that it can be processed. 

Scqucncc(s) conbin 'n*s or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) features section that some may be missing. 

Sequcncc{s) conbin amino add or nudeic add designators which are not standard 

rcprcscnlallons as per the Sequence Rules (Please refer to paragraph 1.822) 

Sequcncc(s) missing, if IntenUonat, please use the following format for each skipped jcquence: 

(2) INFORMATION FOR S EQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not Insert any headings under "SEQUENCE CHARACTERISTICS") 
(xl) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence Is Intentionally skipped 

Please also adjust the •(HI) NUMBER OF SEQUENCES:' response to Indude the skipped sequencc(s). 

Sequcncc(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence Id number 
<400> sequence Id number 
000 

Use of n*s and/or Xaa's have been delected In the Sequence Listing. 
Use of <220> to <223> Is MANDATORY If n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and v^^Ich residue n or Xaa represcnU. 



11 



12 



13 



Use of <21 3>Organism Scquence{s) . 
(NEW RULES) 



are missing this mandatory field or its response. 



Use of <220>Featuro 
(NEW RULES) 



Patcntin ver. 2.0 "bug' 



Sequence(s) are missing the <220>Feature and assodated headings. 

Use of <220> to <223> Is MANDATORY If <213>ORGANISM Is •ArlifidaT or TJnknown" 
Please explain source of geneUc material In <220> to <223> section. 
(See Tederal Register/ 6/01/98. Vol. 63. No. 104. pp. 29631-32) 
(Sec. 1.823 of new Sequence Rules) 

Please do not use "Copy to Disk* function of Palentin version 2.0. This causes a corrupted 

file, resulting In missing mandaltvy numeric IdenUfiers and responses (as Indicated on raw sequence lisUng). 

Instead, please use "File Manaoer* or any olher means to copy file to fioppy disk. 



